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(57) Abstract: The invention relates to nucleosides comprising a reporter moiety which also functions to limit polymerase activity, 
^5 characterised in that the reporter moiety is attached to the nucleoside through a linkage group cleavable by a hydrolase enzyme 
^ wherein the hydrolase enzyme is selected from the group consisting of esterases, phosphatases, peptidases, penicillin amidases, 
^ glycosidases and phosphorylases. 
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NUCLEOTIDE ANALOGUES 

Field of Invention 

The present invention relates to nucleoside and nucleotide analogues. In particular, the 
invOTtion relates to nucleotide analogues comprising enzyme hydrolysable linkage 
groups attaching reporter moieties to the nucleotide. 

Background of the Invention 

Recent improvements in DNA sequencing techniques have sought to meet the 
increasing demands of large scale sequencing. Increasingly, methods in which the 
template nucleic acid molecules are attached to a solid surface are being developed 
(see, for example, US 5,302,509 and US 5,547,839). Such methods dispense with the 
need for an electrophoretic separation step and, with the use of optical detection 
technologies (see, for example, Nie et al Annu. Rev. Biqphys. Biomol. Struct 1997, 
26: 567-96), aim to allow sequencing information at the level of a single molecule to 
be obtained. This has the jfurfher potential for multiple samples to be analysed 
simultaneously. 

One example of such methods is Base Addition Sequencing Scheme (BASS) (see, for 
example, Metzk^ et al. Nucleic Acids Res 1994, Vol.22, No.20; p. 4259-4267). 
BASS is a method involving die incorporation of nucleotide analogues which have 
been modijSed so as to comprise a blocking group which terminates DNA synthesis. A 
primer is aimealed to a template bound to a solid support and sequence data obtained 
by repetitive cycles of iocorporation of modified nucleotides. At each cycle, the 
incoiporated base is identified in situ before being deprotected to remove the blocking 
group and allow the next cycle of DNA synthesis. 

Methods such as BASS rely on the use of nucleotide analogues that possess 
polymerase enzyme blocking (or terminator) groups at the 3' hydroxy! position of the 
sugar on the nucleotide. Typically, the blocking group is a combined terminator and 
label/reporter moiety such that the incorporated nucleotide can be detected while the 
bulky label or reporter moiety itself fulfils the role of blocking a polymerase fix)m any 
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further DNA synthesis. Conveniently, as the terminator group is also the reporter 
moiety, a single reaction allows simultaneous removal of both functions thus allowing 
subsequent DNA synthesis and for incorporation of the next base to be read. 

hi order to allow subsequent rounds of DNA synthesis, these polymerase enzyme 
blocking groups are, typically, attached to the nucleotide via a linking groiq) in such a 
way that they can be removed. However, conventional sequencing strategies require 
high temperatures of cycling (typically approximately 95°C or above) which are 
associated with pH changes in the reaction mixture. Such conditions can cause 
reactivity of certain chemical bonds. Accordingly, the coiq)ling methods for attaching 
blocking and labelling groups to nucleotides which have bera used to date have 
focused on usmg those linking groups which can withstand changes in chemical 
conditions (such as tenq)erature and pH). For example, the blocking and label gioiq>s 
can be attached via photosensitive linkage groups and thus cleavable by light 
irradiation (i.e. photochemical means, see, for exan^le, WO 93/05183) or via 
chemical means. 

WO 01/92284 desaibes nucleotides which conq)rise both a reporter moiety and a 
polymerase blocking group in which the reporter moiety does not also act as a 
polymerase enzyme blocking group. The polymerase razyme blocking group is 
attached to the sugar by means of an enzyme cleavable linker. 

Labeled reagents for nucleic acid sequencing are disclosed in EP 0252683 . Igloi 
(BioTechniques, 1996,21, 1084-1092) reports nucleotide-dye derivatives with utility 
in sequencing. In both cases, stable propynyl amide linking groups are used to attach 
the fluorescent dye to the base of these reagents. 

Snnilarly, stable propynjd Hnkmg groups are used in dye-labelled nucleotides in WO 
97/ 00967 and WO/ 88 10264. 



The use of known nucleotide analogues suffers from a number of disadvantages. 
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Known methods of removing the reporter/tenninator groups require repeated insult by 
reactive chemicals or irradiation which can result in damage to the template DNA 
strand through reactions such as base transformation, crosslinking, or depuiination. 

5 Furthermore, by attaching the bulky reporter moiety in the 3 * position of the 

nucleotide, the abiUty of the DNA polymerase to recognise or tolerate the nucleotide 
may be reduced. In addition to being poorly incorporated, modified nucleotides may 
be inactive (i.e. not incorporated), inhibitory (i.e. inhibit DNA synthesis) or may result 
in an alteration of the polymerase enzyme fidelity. 

10 

Any one of, or a combination oi^ these effects will resuU in a reduced accuracy in flie 
sequence data obtained and, in particular, a decreased signal-to-noise ratio will be 
found on detection. Moreover, this means that the amount of sequence data that can 
be obtained firom successive rounds of enzyme incorporation and cleavage is limited. 
15 For example, if a combined error of qjproximately 3% in incorporation and cleavage 
were to accumulate, the result would be that sequence could only be obtained fixnn 5 
bases or fewer of the traiplate DNA before the decreased signal to noise ratio made 
fiirther sequencing impractical. 

20 Accordingly, there is a need for improved nucleotide analogues. Such analogues may 
have one or more of the following attributes: tolerated by polymerases; stable during 
the polymerization phase; and the reporter groups can be removed efiSciently under 
conditions which minimise damage to the template strand or template-primer 
complex. Preferably, the improved analogues display more than one of these features 

25 and most preferably they display all of these features. 

It is thus an object of the invention to provide nucleoside and nucleotide analogues to 
which reporter moieties, which also limit polymerase-mediated incorporation, are 
attached via enzyme hydrolysable linkage groups. It is also an object of the invention 
30 to provide nucleoside and nucleotide analogues comprising reporter groups which are 
removable on enzyme hydrolysis of a labile moiety attached to the linkage groups of 
the analogues. Such nucleotide analogues are most suitable for using in sequencing 
reactions which involve an isothermic reaction and therefore do not involve exposure 
of the nucleotide analogues to high temperatures and to undesirable variations in 
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chemical conditions. Under the conditions of suitable sequencing reactions, including 
array-based sequencing technologies (such as BASS), enzyme-cleavable groups will 
be essentially stable. The use of enzyme hydrolysable linking groups removes the 
need for harsh, template-damaging treatments to remove the reporter moieties. 

5 

Description of tbe Invention 

The present invention describes the use of linkage groups, cleavable by enzyme 
hydrolysis, to attach reporter moieties to nucleosides and nucleotides. 

10 

Accordingly, in a first aspect, the invention provides a nucleoside comprising a 
reporter moiety which also functions to limit polymerase activity, characterised in that 
the reporter moiety is attached to the nucleoside through a linkage group cleavable by 
a hydrolase enzyme , wherem the hydrolase enzyme is selected from the groiq> 
15 consisting of esterases^ phosphatases, peptidases, penicillin amidases^ glycosidases 
and phosphorylases. 

In contrast to enzyme cleavable nucleosides of the prior art, the reporter moiety serves 
the dual function of providing a detectable label and of limiting polymerase enzyme 
20 activity. Thus nucleosides of the present invention do not comprise a separate 
reporter moiety and a separate polymerase enzyme blocking group. 



25 



A hydrolase is defined as any member of the class of enzymes that catalyse the 
cleavage of a chemical bond with the addition of water. 



la a second aspect, the invention provides a compound of Formula I: 
R- LI - L2- L3 - L4 - L5 -BASE- SUGAR 
30 (I) 



wherein R is a reporter moiety 
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LI and L5 are optional linkage groups each containing one or more 
atoms comprising hydrocarbon chains which may also contain other 
atoms such as N, O and S. 

5 L2 and 14 are optional linkage groiips comprising 1 or more amino 

acid residues. 

L3 is a linkage group that is susceptible to enzymic hydrolysis by a hydrolase enzyme, 
wherein hydrolytic cleavage may be within the group or adjacent to the group and 
10 characterised in that the hydrolase enzyme is selected from the group consisting of 
esterases, phosphatases, peptidases, penicillin amidases, glycosidases and 
phosphorylases. 

Suitably, enzymic hydrolysis of the linkage group L3 produces an unstable moiety 
15 which imdergoes chemical hydrolysis. 

Suitable bases comprise purines or pyiimidines and, in particular, any of flie bases A, 
C, G, U and T or analogues thereof. 

20 Suitably, the sugars comprise ribose or deoxyribose or analogues thereof. Thus 
ribonucleotides and deoxyribonucleotides are envisaged together with ofheac 
nucleoside analogues. 

Suitably, a mono-, di- or tri- phosphate group is attached to the sugar. In a 
25 particularly preferred embodiment, a triphosphate group is attached to the sugar. 

In a preferred embodiment, the composite linker group LI to L5 may be a chain of 10 
to 200 bond lengths and may include atoms selected from carbon, nitrogen, oxygen 
and sulphur atoms, the linker group may be rigid or flexible, unsaturated or saturated 
30 as is well known in the field. The composite linker group may further incorporate 
one or more amino acids joined by peptide bonds. The incorporation of amino acids 
can be through the incorporation of amino acid monomers or oligomers using 
standard amino acid chemistry (see, for example, "Synthetic Peptides - A Users 
Guide" Ed. G. A. Grant; 1992). 



wo 03/048178 



PCT/GBG2/05375 



6 



Hydrolysis of L3 by a hydrolytic enzyme selected from the group consisting of 
esterases, phosphatases, pq)tidases, glycosidases, penicillin amidases and 
phosphorylases results in the polymerase enzyme reporter moiety (R) becoming 
5 detached from the compound. 



In a preferred embodiment, the linkage group L3 is cleavable by hydrolase enzymes 
selected from the group consisting of esterases, phosphatases, peptidases, 
glycosidases and phosphorylases. Preferred peptidases include subtilisin, proteinase 
10 K, elastase, nqjrilysin, thermolysin, papain, plasmin, tr3psin, enterokinase and 
urokinase. Suitable enzymes are those that are reactive under mild conditions (see 
Handbook of Proteolytic Bazymes, Barrett et al, ISBN 0-12-079370-9), M a 
particularly preferred embodimrat, the enzyme-hydrolysable groiq> is cleavable by 
penicillin amidase. 

15 

Preferably, L3 is a pejptide selected from the group consisting of alanine-alanine- 
alanine, alanine-alanine-leucine, glycine-leucine-serine, glycine-serine-alamne- 
alanine-leucine and glycine-alanine-glycine-leudne. 



20 Esterases catalyse the general reaction set out below in Reaction Scheme 1: 
Reaction Scheme 1 



25 



R — C Esterase * R — C + H O C R 

O — C R O — H 



Thus, in a fiulher preferred embodiment of the second aspect, L3 comprises an ester 
group. 



30 



Non-specific esterase activity is associated with a number of enzyme systems. This 
activity has been associated with both physiological fimction and drug metabolism. 
Such a non-specific carboxylesterase activity can be used to modify molecules in 
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vitro. However, the lack of stability of carboxyesters at moderately high pH and 
elevated temperatures can make them unsuitable for generating stable reagents for 
nucleic acid apphcations. 

In a preferred embodiment highly stable peptide bonds are utilised as specifically 
cleavable groups. The linkage groups may be digested with a suitable peptidase as 
shown in Reaction Scheme 2 to remove the reporter moiety without damaging the 
template strand or the template/primer complex. Following deprotection, further 
DNA synthesis can take place leading to the next cycle of labelled analogue addition. 

Where R' and R" both represent one or more amino acid residues, then peptidases 
catalyse the following general reaction set out in Reaction Scheme 2: 



15 Reaction Scheme 2 



R' N- 



R" O 



H H 



R'" O 



H H 



Peptidase 




20 



Suitably, enzymic hydrolysis of the linkage group L3 produces an unstable moiety 
which undergoes chemical hydrolysis. 
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A linker may be assembled such that L3 comprises a moiety that is not part of the 
linker backbone and is not directly bonded to the base or the reporter groups (through, 
for example, linkage groups L2 or L4), which may be enzymatically hydrolysed to a 
chemically unstable form. Thus, on removal of such a moiety by a hydrolytic 

5 enzyme, the unstable linkage group formed rapidly undergoes chemical hydrolysis to 
facilitate cleavage of the linker. This is distinct Jfrom direct enzyoaiatic cleavage of the 
linker by the enzyme in that the covalent bonds broken by the enzyme do not form 
part of the contiguous chain of covalent bonds comprising the linker between the 
nucleoside and the label. An example of this type of "remote cleavage" would be 

10 when L3 consists of an N-phenylacetyl amino acetal. The enzyme penicillin amidase 
specifically recognises the phehylacetyl portion of the molecule arid hydiolyses the 
amide bond, giving phenyl acetic acid and a hemiaminal (see, for example, WO 
97/20855). The hemiaminal is highly susceptible to acid or base catalysed hydrolysis 
in the absence of the enzyme. Thus if L3 comprises such a unit, treatment with 

15 penicillin amidase would render the linker chemically unstable, resulting in cleavage 
on base or acid catalysed hydrolysis. This is dqpicted in Reaction Scheme 3: 



Reaction Scheme 3 



O 



HO 




NH 



Penk5Dnn amidase Add or Base 1 — OH 

i ' 



Otiier systems that work on this principle could be devised and incorporated into the 
design of any linker. The particular advantage of this type of linker cleavage is that 
the enzyme recognition and cleavage site is remote from the linker cleavage site and 
may be less affected by the proximity of the other components of the linker, the label 
25 or the nucleoside. 
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Accordingly, in a particularly preferred embodiment, L3 comprises a penicillin 
amidase cleavage site. Penicillin amidase is also known as penicillin aminohydrolase 
(EC 3.5.1.11). 

5 Suitable methods for attaching a linker comprising an enzyme cleavable group to a 
base moiety are described, for example, in Cavallaro et al Bioconjugate Chem. 2001, 
12, 143-151. Further methods are described in Langer et al, Proc Natl Acad Sci USA, 
1981, 78, 6633-6637; Livak et al. Nucleic Acids Res, 1992, 20, 4831-4837 and 
Gebeyehu et al. Nucleic Acids Res, 1987, 15, 4513-4534. 

10 

A suitable reporter moiety, R, may be any one of various known reporting systems. It 
may be a radioisotope by means of which the nucleoside analogue is rendored easily 
detectable, for example ^^P, ^^S incorporated in a phosphate or thiophosphate or H 
phosphonate group or alternatively or ''^C or an iodine isotope. It may be an 
15 isotope detectable by mass spectrometry or NMR. It may be a signal moiety e,g. an 
enzyme, hapten, fluorophore, chromophore, chemiluminescent group, Raman label, 
leucodye, electrochemical label, or signal compound ad^ted for detection by mass 
spectrometry. 

20 Jna preferred embodiment, the reporter moiety has fluorescent properties and can be 
detected using a sensitive fluorescence detector. It may be a fluorophore, for exanople, 
selected from fluoresceins, rhodamines, coumarins, BODBPY® dyes, phenoxazine 
dyes, cyanine dyes and squarate dyes (described, for example, in WO 97/40104). 
Preferably, the dyes are acridone derivatives, as described in patent £^lications GB 

25 01 13435.2 and GBOl 13434.5. Most preferably, the reporter moiety is a cyanine dye. 
The Cyanine dyes (sometimes referred to as "Cy dyes™'*), described, for example, in 
US Patent 5,268,486, are a series of biologically compatible fluorophores which are 
characterised by high fluorescence emission, environmental stability and a range of 
emission wavelengths extending into the near infra-red which can be selected by 

30 varying the internal molecular skeleton of the fluorophore. 

In a preferred embodiment, the modified nucleotide remains compatible with 
elongation enzyraology, i.e. it can still be incorporated by a polymerase. Procedures 
for selecting suitable nucleotide and polymerase combinations will be readily adapted 
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from Metzker et al. Nucleic Acids Res 1994, Vol. 22, No. 20, 4259-4267. In 
particular, it is desired that a selected polymerase be capable of selectively 
incorporating a nucleotide. 

5 In another preferred embodiment, the reporter group, R, restricts further elongation of 
the polymer to a Umited number of additions by a polymerase once the nucleotide of 
the present invention has been incoiporated by a selected polymerase in specified 
polymerase enzyme conditions. 

10 The invention further provides a chemical intermediate selected from the group 

consisting of: 5-N-(N-Trifluoroacetyl-p-alanyl)propargylamino-2'-deoxyuridine; 5-N- 
(p-alanyl) propargylamino-2'-deoxyuridine; 5"N-(N-Fluorenylmethyloxycarbonyl- 
Gly-Gly-Leu-p-alanyl)pn>pargylamino-2*-deoxyuridine; 5-N-(-GIy-Gly-Leu-|J- 
alanyl)propargylamino-2'-deoxyuridine and S-N-[N-(6-Fluorescein-S(and- 

15 6)carboxamidohexanoyl)-Gly-Gly-Leu-P-alanyl]-propargylamino-2'-deoxyur^ 

In a third aspect , the invention provides a chemical intermediate of Formula II 



R-L1-L2-L3-L4-L5 



20 



wherein 



R is a reporter moiety 



25 



LI and L5 are optional linkage groups each containing one or 
more atoms comprising hydrocarbon chains which may also 
contain other atoms such as N, O and S, 



30 



L2 and L4 are optional linkage groups comprising 1 or more 
amino acid residues, and 
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L3 is a linkage group that is susceptible to enzymic hydrolysis by a peptidase enzyme. 
Chemical intennediates of Fomula n are of use as dye-linker groups in the synthesis 
of compounds of Formula I. 

5 In a preferred embodiment, L3 is selected jfrom the group consisting of alanine- 
alanine-alanine, alanine-alanine-leucine, glycine-leucine-serine, glycine-serine- 
alanine-alanine-leucine and glycine-alanine-glycine-leucine. 

Preferably, the reporter R (or dye) is selected from the ffoxxp consisting of of 
10 fluoresceins, rfaodamines, coumarins, BODIPY® dyes, phenoxazine dyes, cyanine 
dyes, acridone dyes and squarate dyes. 

In a fourth aspect, the invention provides a compound comprising 5-iV-[iV-(6- 
Fluorescein-5(and-6) carboxamidohexanoyl)-Gly-Gly-Leu-P-alanyl]-propargylamino- 
15 2'-deoxyuridine triphosphate. 

In a fifOi aspect, the invention provides a set of nucleotides charact^sed in that the 
set contains at least one compound of Formula 1 having a mono-, di- or triphosphate 
group attached to the sugar. Preferably, the set comprises each of the four natural 
20 bases A, G, C and T and analogues thereof 

In a preferred embodunent of the fifth aspect the set of nucleotides imtiier comprises 
at least two compounds as described above having different bases, characterised in 
that each compound has a different reporter moiety, R. Thus, for example, the set of 
25 nucleotides may comprise compounds with bases A and G ,wherein the compound 
with base. A, has a first reporter moiety (R}) and the confound with base, G, has at 
second reporter moiety (R^), and the first and second reporter molecules are 
distinguishable fix>m each other. 

30 In another preferred embodiment of the fifth aspect, the set of nucleotides comprises 
four compounds as described above characterised in that each compound has a 
different base such that each of the bases A, G, C and T, or analogues thereof, are 
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present and each of the four compounds has a reporter moiety which is distinguishable 
from the reporter moiety of each of the compounds having the other three bases. 



In a sixth aspect, the invention provides a method for nucleic acid molecule. 
5 sequencing comprising the steps of : 

a) immobilising a complex of a primer and a template to a soUd phase 

b) incubating with a polymerase in the presence of a compound of Formula I having 
a mono-, di- or triphosphate group attached to the sugar. 

10 hi one embodiment of the sixth aspect, the complex of primer and template can be 
preformed by incubation under ^ropriate hybridisation conditions before 
immobilising the complex onto a solid phase. In another embodiment, the prim^ or 
the template can be immobilised onto a solid phase prior to formation of the coniplex 
by introduction of the appropriate hybridisation partner (i.e. t^plate or primer, 

IS respectively). In yet another embodiment, the complex immobilised onto the solid 
phase can be a single nucleic acid molecule comprising both '^primer" and 'template'*; 
for example, Ae immobilised poly- or oUgo- nucleotide can be a hairpin stmctuie. 

Suitable polymerases are enzymes that perfoim template-dependent base addition . 

20 including DNA polymerases, reverse transcriptases and RNA polymerases. Suitable 
native or engineered polymerases include but are not lincuted to T7 polymerase, the 
Klenow fragment of coli DNA polymerase I which lacks 3'-5'exonuclease activity, 
E, coli DNA polymerase HI, Sequenase™, ^29 DNA polymerase, exonuclease-fiee 
Pfii, exonuclease-free Vent™ polymerase, Thermosequeoase, Thermosequeoase H 

25 Tth DNA polymerase, Tts DNA polymerase, MuLv Reverse transcriptase or HIV 
reverse transcriptase. The selection of an appropriate polymerase depends on the 
interaction between a polymerase and the specific modified nucleotide (as described 
by Metzker ei al. Nucleic Acids Res 1 994, Vol.22, No.20; p. 4259-4267). 

30 Nucleotides comprising hydrolase-cleavable linkage groups such as carboxyl ester 
attachment groups are suitable for use in sequencing reactions used in array based 
sequencing, such as BASS. Such reactions are isothennic, unlike cycle sequencing, so 
allowing much better control of reaction conditions. In particular, the sequencing 
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reaction takes place at relatively low temperatures (typically less than 70^C) thus 
enabling enzyme-cleavable Hnkage groups, such as the carboxyl ester attachment, to 
remain stable xmder these sequencing reaction conditions. Accordingly, polymerases 
which may be useful in the sixth aspect of the invention include thennostable 
5 polymerases and non-thermostable polymerases. 

In a preferred embodiment of the sixth aspect, the method further comprises the steps 
of: 

c) detecting flie incorporation of a compound of Formula I having a mono-, di- or 
10 triphosphate group attached to the sugar 

d) incubating in the presence of enzyme under suitable conditions for enzymatic 
cleavage of the enzyme-cleavable groiq) L3 

Preferably, the hydrolase enzyme is selected fiom the group consisting of esterases^ 
15 phosphatases, peptidases, penicillin amidases, glycosidases and phosphorylases. 

Suitable conditions for enzyme hydrolysis of the enzyme-cleavable groups will 
depend on tiie nature of the enzymes involved. Enzymes such as carboxyesterases are 
active und^ a broad range of conditions and do not require co-&ctors. Conmiercially 
20 available carboxyesterases will hydrolyse esters under mild pH conditions of between 
pH 7.0 and pH 8.0. e.g. O.IM NaCl, 0.05M Tris.HCl, pH 7.5. 

Suitable peptidases may be selected fix>m tiie group consisting of subtilisin, proteinase 
K, elastase, nq>rilysin, thermolysin, p^ain, plasmin, trypsin, enterokinase and 
25 urokinase. Suitable conditions for cleavage by peptidases are exemplified in Example 
5 below. 

In another embodiment of the sixth aspect, the method further comprises: 

e) repeating steps a)-d) 

30 

In a preferred embodiment of the sixth aspect, the enzyme in step d) is penicilUn 
amidase. 
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In a preferred embodiment of the sixth aspect, the incorporation of the compound is 
determined by the detection of a single reporter group attached to the compound. 

Briefly, sequencing reactions using modified nucleotides in accordance with the 
5 second aspect of the invention may be perfonned as follows. Primer template 

complexes are immobilised to a solid surface and contacted with modified nucleotides 
in the presence of a suitable buffer also containing a polymerase, such as Klenow 
fragment of E. coli DNA polymerase I which lacks 3'-5'exonuclease activity, and a 
commercially available pyrophosphatase. The reaction is incubated under suitable 

10 conditions for a polymerase-mediated base addition reaction followed by the removal 
of non-incorporated nucleotides and enzymes by washing with a wash buffer. 
Suitably, fbe wash buffer contains a buffering agent, such as an organic salt, to 
maintain a stable pH of approximately pH 6 to pH 9 and possibly also comprises 
monovalent or divalent cations and a detergent so as to eliminate non-covalently 

15 bound molecules from the solid surface. Where the modified nucleotides comprise a 
fluorescent reporter molecule, incorporated nucleotides are detected by measuring 
fluorescence. Following identification, the templates are contacted with a buflfered 
solution containing an excess of a protein displaying the appropriate enzyme activity 
and incubated under conditions for enzyme cleavage activity. For example, \i^ere flie 

20 euzyme-cleavable group linking the reporter moiety to the nucleotides is a peptide 
group, the solution contains an excess of a protem displaying peptidase activity. 
Following enzyme activity, the soluble products of enzymatic cleavage are eliminated 
by washing as above. Following the washing step, the immobilised template is 
washed with an excess of buffer used for the polymerase reaction and the steps of 

25 polymerase-mediated base addition, detection of incorporated nucleotide and enzyme- 
cleavage activity are repeated to obtain further sequence data. 

Specific Description 

30 For the purposes of clarity, certain embodiments of the present invention will now be 
described with reference to the following figures and examples: 

Figure 1 (Example 1) shows a reaction scheme for synthesising a compound of 
Formula 1. 
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Figure 2 (Example 2) depicts enzyme cleavage of a compound of fonnula L 

Figure 3 (Example 3) shows a reaction scheme for the synthesis of a nucleotide with a 

penicillin amidase cleavable linker. 

Example 4 describes the incorporation of a compound of Formula I by a DNA 
5 polymerase. 

Example 5 describes protease mediated cleavage of FaniHex-GGL-P-A-2*dU 
Example 6 describes the preparation of dye-labelled nucleosides with protease 
cleavable linkers 

10 Example 1 

A reaction scheme for the synthesis of an example of a compound of Fonnula I 
containing a peptide-based linker is set out in Figure 1 . 

15 i) S-J\^fJ\r-Trifluoroacetvl-p>alanvnpropargvlamino-2*-4^^ 

5-Propargylamino-2'-deoxyuridine (1) (1.3g, 4.6rQmol) and JV^trifluoroacetyl-P- 
alanine succinimidyl ester (1.29g, 4.6mmol) were dissolved in DMF (lOmL) at 
ambient temperature. Triethylamine (0.46g, 0.6mL, 4.6mmol) was added and the 

20 solution stirred overnight at ambient temperature. The solvent was then removed 
under vacuum. The residue was then re-dissolved in dichloromethane : methanol 
(1:1) and eluted througji a flash silica gel column (dichloromethane : methanol, 9:1). 
Removal of solvent from the appropriate fractions (R/-0. 1, dichloromethane : methanol 
9 :1) afforded the title compound as a white foam (0.7g, 34%). NMR (300MHz, 

25 dg-DMSO) 5 1 L62 (IH, s, N^-H), 9.48 (IH, t, br, CFsCOJJH), 8.48(1H, t, /5.4Hz, 
propargyl NH), 8.15 (IH, s, H-6), 6.09 (IH, app t, J5.6Hz, H~r), 5.25 (IH, d, J4.5, 
3'-0H), 5.10 (IH, t, y4.5H2, 5'-0H), 4.22 (IH, m, H-40, 4.07 (2H, d, JS.lHz, 
propargyl CH2), 3.77 (IH, m, H-30, 3.61 - 3.51 (2H, m, H-5'), 3.40- 3.33(2H, P-ala 
CH2CONH part obs. by HDO), 2.37 (2H, t, J6.9Hz, p-ala CH2NHTFA), 2.10 (2H, dd, 

30 /4.8Hz, H.20; "C NMR (75.45MHz, de-DMSO) 5 169.37, 161.61, 156.19 (q, / c-f 
35.9Hz), 115.85 (q, /c-f 286.5Hz), 98.06, 89.45, 87.62, 84.70, 74.44, 70.23, 70.13, 
61.02, 35.78, 33.80, 28.58, 25.20, 8.75; MS (ES+) m/z 449 (M+H)^ 466(M+H20)^ 
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ii) S-A/^(P-alanvl)propargvlammo-2'-deoxvuridinef3^ 



5-;Vi(7/-Trifluoroacetyl-p-alanyl)propargyIaiiimo-2'-deoxyuri (2) was dissolved 
in concentrated aqueous ammonia at ambient temperature. The solution was allowed 
5 to stir overnight at ambient temperature. The solvent was then removed under 
vacuum to give the title compound as a pale yellow foam (0.62g, 100%). NMR 
(300MHz, ds-DMSO) 8 8.61 (IH, t, 75.1Hz, piopargyl NH), 8.16 (IH, s, H-6), 6.09 
(IH, app t, y6.6Hz, H-1 4.21 (IH, m, H-4'), 4.10 (2H, d, JS.lHz, propargyl CH2), 
3.79 (IH, m, H-30, 3.59 (2H, m, H-5'), 2.96 (2H, t, J6.9Hz, P-ala CH2), 2.44 (2H, t 
10 partly obs., p-ala CH2), 2.10 (2H, m, H.2'); ^^C NMR (75.45MHz, dg-DMSO) 8 
169.23, 161.61, 149.41, 143.76, 97.98, 89.25, 87.64, 84.74, 74.60, 55.46, 35.31, 
32.36, 28.62, 25.20; MS (ES+) m/z 353(M+H)^ 

iii) 5"iV^OV-Fluorenylmethvloxycarbonvl-Glv-GlV"Leu-B-alanvlVropar^ ^ 
15 2'-deoxvuridine (4) 

5-;V-(P-alanyl)propargyIamino-2'-deoxyuridine (3) O.lg (0.28mmol) and iV- 
fluorenylmethyloxycarbonyl-GIy-Gly-Leu succinimidyl ester (0.1 7g, 0.3mmol) were 
weighed into a round bottom flask and then dissolved in anhydrous DMF (ImL). 

20 Triethylamine (0.061 g, O.OSmL, 0.6nmiol) was then added and the reaction mixture 
stirred at ambient temperature. After 2.5 hours the solvents were removed under 
vacuum and the residue redissolved in dichloromethane - methanol (9:1) and eluted 
with dichloromethane - methanol (9: 1 then 8:2). Collected and removed solvent fiom 
fractions containing R^O.5 material (8:2 dichloromethane : methanol). Obtained the 

25 title compound as a white solid 0.04g (1 8%). ^H NMR (300MHz, de-DMSO) 5 1 1 .60 
(IH, s, br, N^-H), 8.38 (IH, t, amide NH), 8.15 (IH, s, H-6), 7.95 (IH, t, amide NH), 
7.90 (IH, d, amide NH), 7.88 (2H, d, J7.5H2, Fmoc), 7.70 (2H, d, •/7.5Hz, Fmoc), 
7.62 (IH, t, amide NH), 7.41 (2H, t, J7.5Hz, Fmoc), 7.30 (2H, t, 77.5Hz, Fmoc), 6.09 
(IH, app t, J6.6HZ, H-1 0, 5.24 (IH, d, br, 3'OH), 5.09 (IH, t, 5'.0H), 4.29 - 4.18 

30 (4H, m), 4.05 (2H, m), 3.79 ~ 3.54 (5H, m), 2.25 (2H, m, p-ala CH2), 2.10 (2H, m, H- 
2'), 1.51 (IH, sept., leucyl CH2CH(CH3)2), 1.42 (2H, ra, leucyl CH2CH(CH3)2), 0.84 - 
0.78 (6H, 2d, leucyl CH2CH(CH3)2); MS (ES-f) m/z 802(M+H)''. 
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iv) 5-J\r'^Glv-GlY-I^u-p-alanvl'lpropargvlamino-2'«deoxvuridme 

5-7V:.(^-FluorenyImethyloxycarbonyl"Gly-Gly-Leu-P-alany 
deoxyuridine (4) was dissolved in anhydrous DMF at ambient temperature. 
5 Piperidine was tibten added and the solution stirred at ambient temperature overnight 
Solvents and volatile reagents were removed under vacuum. The product was used 
without further purification. MS (ES+) m/z 580(Mh-H)^ 

v) 5-JV'-fJV-(6-nuorescein-5fand-6^carboxamidohexanovlVGlV'Glv-Leu-P- 
10 alanvll-propargvlamino-2'-deoxvuridine (6) 

5-iV^-Gly-Gly-Leu-P-alanyl)propargylamino-2'-deoxyuridine (5) (11.2 ^mol) and 6- 
(fluorescein-5(and 6)carboxamidohexanoic acid succinimidyl ester (0.098g, 
16.8^mol) were dissolved in DMF (0.2mL) at ambient temperature. Triethylamine 

15 (3)xL, 22.4|amol) was added and the solution allowed to stand at ambient temperature 
overnight. Solvents and volatile reagents were removed under vacumn. The residue 
was washed with dichloromethane - methanol - acetic acid 90:9:1 to remove 
unreacted dye and residual triethylamine. The solid residue was then dissolved in 
methanol - water and purified by reverse phase HPLC (isocratic water - methanol 1:1 

20 / C18 stationary phase). Obtained the title compound (5.3|jmol) as an orange powder 
after lyophihsation of the appropriate fiactions. NMR (300MHz, d4-MeOH) 5 
8.47 (IH, s), 8.28 (IH, s), 8.20-8.00 (3H, m), 7.31 (IH, d), 7.10-7.00 (4H, m), 6.21 
(IH, app t, H-r), 4.6 (IH, m), 4.30 (2H, m), 4.21 (2H, d, propargyl CH2), 3.9 - 3.6 
(7H, m, glycyl a-H, H-5% leucyl a-H), 2.5 - 2.1 (6H, m, H-2% p-ala CH2, caproamide 

25 CH2CONH), 2.75 - 1.4 (12H, caproamide CH2, leucyl CH^CH(CH.)A 0.80 (6H, 2d, 
leucyl CH2CH(ai3)2); MS (ES+) nt/z 1051(M+H)''; UV X^n^ 498nm (MeOH - 
NH40H,pH9). 

vi) 5rA^"ri\^-f6-Fluorescein-5fand-6)carboxamidohexanovlVGlV"Glv-I^U"fi- 
30 alanvl]-propar|gvlamino-2*-deoxvuridine triphosphate 



5-iV'-(JV^Fluorenylmethyloxycarbonyl-Gly-Gly-Leu-p-alanyl)propargy^ 

deoxyuridine (4) may be converted to a dye-labelled triphosphate by using established 
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triphosphate syntheses (for example, see K. Burgess & D. Cook,, Chem. Rev. 2000, 
100, 2047 - 205P and references cited therein). The triphosphate of compoimd (4) 
could then be treated with piperidine under the same conditions as used to prepare 
compound (5) and then labelled as described above for the preparation of compound 
5 (6). 

Example 2: Protease Mediated Cleavage of 5-Ar-[jV^(6-Fluorescein-5(and- 

6karboxamidohexanovl)-Glv-Glv-Leu-p-danvl1>propargvlarDinO"2'-deoxvuridine 

fFamHex-GGL-B-A2'dlJ) 

10 

Figure 2 shows the hydrolytic cleavage of FamHex-GGL-P-A2'dU (€) by the 
proteolytic enzyme, subtilisin. Compotmd 6 is readily digested by subtilisin 
(Subtilopeptidase A, type Vin, Sigma Chemical Company, UK), which cleaves at the 
leucine residue, following 2 hours incubation at 37® C at pH 7,5 to yield the 
15 nucleoside and dye-labelled products shown. 

Example 3: Synthesis of a Nucleotide with a Penicillin Anoddase Cleavable linker 

Figure 3 shows a reaction scheme for the preparation of a nucleotide with a penicillin 
20 amidase cleavable linker. 

5-Hydroxymethyl-5',3'-di-C>-j7-toluyl-2'-deoxyuridine (7) (prepared using established 
procedures (T. Ueda, Chemistry of Nucleosides and Nucleotides, Vol. l.Ed. L. B. 
TowOTsend) and JV-[a-thioethyl-JV"trifluoroacetylaminopropyl benzamide] 

25 phenylacetamide (8) (Flitsch eL al. Tetrahedron Letters 1998, 39, 3819 - 3822 and 
references cited therein; Flitsch et aL WO 97/20855) may be combined in the 
presence of iV^iodosuccinimide to give compound (9). Compound (9) may be 
converted to the intermediate (10) by treatment with sodium methoxide in methanol, 
followed by ethyl trifluoroacetate in methanol. Conversion of the nucleoside (10) to a 

30 triphosphate (11) may be achieved by using established triphosphate synthesis 
conditions (For examples see K. Burgess, D. Cook. Chem. Rev. 2000, 100, 2047 - 
2059 and references cited therein). Labelling of the triphosphate witii a reporter group 
may then be achieved by exposxu-e of compound (11) to proprietary labelUng reagents. 
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such as 6-[Fluorescem-5(aDd-6)-carboxamidohexanoic acid succirdmidyl ester, in a 
suitably buffered aqueous solution to give (12). 

Example 4: DNA polymerase assays 

5 

Incorporation of fluorescentlv labelled deoxynucleotide triphosphates by DNA 
polymerases 

(i) Materials 

10 

The following ohgnucleotides (Ihteractiva Biotechnologie, Germany) w^e used for 
assaying nucleotide incorporation: 

Prima: Sequence:- 
15 . SEQ.IDNol: 

5'- TAA CTC ATT AAC AGG ATC-3' 

Template oligonucleotides:- 

20 SEQ.E)No2: 

ATTCGCGGTATTCTGGTATGAAGCTTTTAGATCCTGTTAATGAGTTAGTA 
SEQ.ro No 3: 

ATTCGCGGTATTCTGGTATGAAGCTTTAAGATCCTGTTAATGAGTTAGTA 

25 

SEQ.ro No 4: 

ATTCGCGGTATTCTGGTATGAAGCTTAAAGATCCTGTTAATGAGTTAGTA 
SEQ.roNoS 

30 ATTCGCGGTATTCTGGTATGAAAAAAAAAGATCCTGTTAATGAGTTAGTA 
Fluorescently labelled nucleotides were obtained from the following sourcesr- 



Molecular Probes hic: 
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Alexa Fluor 546-14-dUTP ImM in TE 
Alexa Fluor 568-5-dUTP IniM in TE 
Alexa Fluor 594-5-dUTP ImM in TE 



5 Nen,UK: 

Fluorescein-12-dUTP 
Coumarin-5-dUTP 
Tetraniethykhodamiiie-6-dUTP 
Texas Red-5-dUTP 
10 lissainine-S-dUTP 

N^thofluorescein-5-dUTP 
Fluorescein Chlorotriazinyl-4-dUTP 
Pyrene-8-dUTP 

Diethylaminocoumarin--5--dUTP 

15 

Amersham Biosciences^ UK: 

CySdUTP 

CySdXJTP 

20 (ii) Oligonucleotide labelling 

Oligonucleotide 1 was labelled with [ganmia-33P]ATP (Amersham Biosciences) 
using T4 Polynucleotide Kinase (Amersham Biosciences) in accordance with the 
manufacturer's protocol. 

25 

(iii) Primer extension reactions 

Each 20fil primer extension reaction, assembled in O.Sml thermal cycling microfiige 
tube, contained at a final concentration Ix Thermosequenase ™ buffer (Amersham 
30 Biosciences), O.OSmM 33P labelled oligonucleotide 1, 0.125inM template 

oligonucleotide, 0.125mM dye-labelled nucleotide and 0.165 units of enzyme. The 
reactions were denatured at 95*^0 for 1 minute before incubation at 48°C for 45 
minutes. 
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The reactions were terminated by the addition of 5jil of stop buffer comprising 
0.1%(w/v) Xylene Cyanol, 0.1%(w/v) Bromophenol blue and 80% Formamide. The 
reactions were heated to 90°C for 3 minutes then chilled on ice. 

5 A 14% denaturing polyacrylamide gel in IxTBE (Sequagel, Flowgen Ltd, UK and 
Sigma Chemical Co., UK, respectively) was pre-run at 40W constant power for 
3Qmin. in Ix TBE running bujffer. An 8|il aliquot of denatured reaction was then 
applied to each lane and the samples electrophoresed for 1 .5 hours at 40W constant 
power. The gel was exposed to a phosphor screen (Amersham Biosciences) for 1 hour 
10 and the image detected on the Storm™ hnager (Amersham Biosciences) in 
accordance with the maufacturers guidelines. 

(iv) Results 

15 All the fluor-labelled nucleotides were incorporated by Thermosequenase. Templates 
2, 3, 4 and 5 had 1, 2, 3, or 8 consecutive A residues respectively. The number of 
consecutive fluor-labelled nucleotides added was dq)endent upon the template and the 
fluor species and is summarised below: 

20 Fluor predominant product with templates 3,4 and 5 

Cy3 One addition, with minor two additions 

Cy5 One addition, with minor two additions 

Tetramethykhodamine One addition, with minor two additions 

Coumarin One addition, with a comparable amount of template 



witii two additions 



Alexa 546-14-dUTP 



One addition, with a comparable amount of template 
with two additions 

One addition, with minor two additions 
One addition, with very shght two additions 
One addition, with very slight two additions 
One addition 



Fluorescein 



Texas Red 



Lissamine 



AIexa568-5-dUTP 



Alexa 594-5-dUTP 



One addition 
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Even when the tenaplate contained three or more consecutive dA residues 
(oligonucleotide 4 and 5), the number of nucleotides added did not noticeably exceed 
two bases. The efficiency of the second base addition appeared to be dependent on 
the label attached. Some fluors such as Coumarin appeared to demonstrate greater 
5 efficiency in adding two consecutive bases. 



Example 5: Protease mediated cleavage of 5-N-fN-f6-Fluorescein-5fand-6) 
carboxamido hexanovlVGlv-Glv-Leu-P'alanvn-propargvlamino-2'-deoxvuridine (€i 
fFamHex-GGL-p-A-2'dUt 

10 

A 10\ig aliquot of the substrate FAMHEX-GGL-P-A 2'dU (compound 6 above) was 
dissolved in O.IM sodium acetate buffer pH 7.S containing 5mM calcium acetate. 
The substrate solution was digested at ST'^C for 2 hours in 200^1 in the presence of 0.5 
units of subtihsin (Subtilopeptidase A, type Vm, Sigma Chemical Co. UK) 

15 

The resulting digest was ultrafiltered using a Microcon YMIO Concentrator (Milipore 
Ltd., UK) and the filtrate analysed by HPLC (Gilson 170, Gilson, UK) on a 
Sephasil™ peptide C-18 colimm (Amersham Biosicences, UK), Four major peaks of 
absorbance at 438 nm were observed in the substrate and product. These eluted at 
20 29.8 (I), 32.6 (II), 26.7 (EI) and 37.1 (IV) minutes on an acetonitrile gradient of 50% 
to 70%. The peaks were collected and analysed by MALDI-TOF mass spectrometry 
(Bruker Biflex II, Brucker, Germany). A matrix of lOmg/ml 3-hydroxycinnamic acid 
in acetonitrile and 1% TFA was used for the analysis. The mass spectrometer Was 
calibrated with lOOmM Bradykinin 1-7 (Sigma Chemical Co., UK). 

25 

The undigested substrate contained a predominant peak (I) that was shown by mass 
spectrometry to contain the intact substrate molecule. The remaining peaks 
corresponded to components of the intact molecule probably carried over from the 
synthesis. After substrate digestion with subtilisin, peak I was seen to diminish 
30 substantially and peak 11 demonstrated a corresponding increase in absorbance at 
438nm. The latter peak was shown to contain a molecular ion of 717 that 
corresponded to a fluor-linker moiety resulting from the hydrolysis of the substrate at 
the peptide bond on the carboxyl side of the leucine residue. These observations were 
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consistent with the cleavage of the linker by the subtilisin enzyme. There was no 
observable change in peaks III and IV, suggesting that they do not act as substrates for 
the protease enzyme, 

5 Example 6: Preparation of dve-labelled nucleosides with protease cleavable linkers 

Linker motifs suitable for use in protease cleavable linkers were prepared and 
identified using the following procedmes. 

10 (i) General LibrarvAmino Acid Coupling Procedure 

A polystyrene resin loaded with 5-propargylamino-2'-deoxyuridine was prepared 
using standard solid phase chemical methods. The resin was distributed between 21 
disposable filter vessels (10-20 mg per vessel). The filter tubes containing the resin 

15 were placed on a vacuum manifold, DCM was added to swell the resin and the excess 
was drained off. An microtag was added to the vessel for identification purposes. 
Solutions of Fmoc-AA-OH (or Fmoc-Ahx-OH), DIC and HOBt in DCM/ DMF were 
prepared. The ^propriate solution of activated amino acid was added, the reaction 
vessels, attached to the manifold, were placed horizontally on a flat bed shaker. The 

20 vessels were agitated for 3-3.5 hours. The vessels were drained of the reaction mixture 
and the resin was washed with: DMF (5 x 1ml), DCM (5 x 1ml), MeOH (5 x 1ml), 
Et20 (3 X 1ml), (volume of washings are shown per filter tube). 

(ii) General Library Fmoc Deprotection Procedure 

25 

DMF was added to the vessels to aUow the resin to swell. A&er draining oflF the DMF 
20% piperidine in DMF was added and the vessels left to agitate for 1 hour. The 
deprotection mixture was drained off and the resin was washed with: DMF (5 x 1ml), 
DCM (5 x 1ml), MeOH (5 x 1ml), Et20 (3 x 1ml). 

30 

Ammo acid coupling and Fmoc deprotection procedures were repeated with different 
combinations of activated amino acids added to each tube to generate a library of 
compounds. Fmoc-Ahx-OH was the final amino acid to be added to all portions of 
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the resin. After final Fmoc deprotection the compounds were labelled with dye and 
cleaved from the solid support according to the procedures described below. 



5 (iii) Genial Fluorphore Coupling Procedure 

To the pre-swollen resin (DMF) was added a solution of fluorescein isothiocyanate 
isomer I and NEt3 in DMF (1 ml). The resin was shaken overnight and was then 
washed with DMF (5 x 1 ml), DCM (5 x 1 ml), MeOH (5 x 1 ml) and Et20 (5 x 1 
10 ml). 

(iv) General Cleavage Procedure 

5% TFA in DCM (0.2 ml) was added to resin (8 to 14 mg) and left to stand for 2 
15 minutes. DCM (0.3 ml) was added and the resin was left to stand for 3 minutes before 
the DCM/ TFA mixture was collected. 5% TFA in DCM (0.2 ml) was added and the 
resin left to stand for 2 minutes. A drop of MeOH and DCM (0.3 ml) was added and 
the resin was left to stand for 3 minutes. This cleavage procedure was repeated (x ^ 3) 
to ensure maximum release of compound. An average loading of 0.430 mmole/g 
20 could be determined by HPLC. 

(v) Protease Cleavage Assay 

All compounds prepared were assayed with a selection of the preferred proteases to 

25 determine the most suitable linker / protease combinations. 

A solution of approxunately lOjag of nucleoside in water (5p,l) was mixed with 1 unit 
of protease (1 - 3^1 depending on enzyme stock solution) in the appropriate bufifer for 
the enzyme. Final volumes of solutions were made to 200^1. Solutions were then 
held at enzyme optimum temperature for two hours and then centrifoged through 

30 1 0,000D molecular weight size exclusion membranes [Amicon Microcon YM-1 0] to 
remove the enzyme. The appropriate protease substrates reactions and standard 
control solutions were also prepared and treated in the same manner. All solutions 
were analysed using CI 8 reverse phase h.p.l.c (0.1%TFA/water : 
0.042%TFA/Acetonitrile, 95:5 - 0:100 linear gradient). Successful cleavage of the 
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linker was observed when several new products appeared on h.p.lx., some of which 
possessed only the nucleoside chromophore. 



5 (vi) Preparation of representative dve-labelled nucleotide triphosphate with a 
protease cleavable linker 

Preparation of a dye labelled nucleotide triphosphate incorporating a protease 
cleavable linker was performed as described below after selection of a suitable linker 
10 motif from the nucleoside library. 

a) Preparation of Dve-Linker Group 

The synthesis of an example of a dye-linker group, according to Foimula II, is 
15 described below. 

Cy5 carboxylic acid (lOQmg, 0.17mmol) [Amersham Biosciences] and N,NJ^^•,N*- 
tetramethyl-0-(N-succinimidyl)uroniumtetrafluoroborate (57mg, 0.19nmiol) [Fluka] 
were weighed into an oven-dried round-bottom flask. Anhydrous dimethylsulfoxide 
20 (250^il) [Aldrich] was then added to dissolve the solids. Neat diisopropylethylamine 
(32mg, 025mmol, 43}a1) [Aldrich] was then added. The resulting solution was then 
stirred at ambient temperature for 4 hours. Thin layer chromatogr^hy (4:1 
dichloromethane : methanol) showed flie complete conversion of the starting material 
(Rf 0.2) to the N-hydroxysuccinimidyl ester (Rf 0.5). A solution of IGN- 
IS GlyGlyLeu-OH (42mg, 0. 1 7mmol) [Bachem] was prepared by heating the peptide in 
anhydrous dimethylsulfoxide at 60°C. The solution was then allowed to cool to 
ambient temperature and was fhea added immediately to the solution of flie Cy5- 
carboxylic acid N-hydroxysuccinimidyl ester. The reaction mixture was stirred at 
ambient temperature overnight. Analysis by thin layer chromatography (4:1 
30 dichloromethane : methanol) showed the complete conversion of the active ester (Rf 
0.5) to a new product which was immobile on thin layer chromatography. The 
majority of the solvents were then removed under vacuum to give the crude product 
as a daric blue oil. The oil was redissolved in methanol (SO^il). Half of the solution 
was then added to the top of a flash silica gel column and then eluted with 79:20:1 
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dicMoromethane : methanol : acetic acid. Fractions were collected and those 
containing the predominant blue coloured component were pooled. The solvrat was 
then removed under vacuum to give the pure Cy5-GlyGlyLeu conjugate as a dark blue 
solid. 6h(CD30D, 300MHz) 8.3(2H, 2t, vinylic), 7.8(2H, m, aromatic), 7.6-7.3(5H, 
5 m, aromatic), 6.6(1H, t, vinylic), 6.4(1H, d, vinyUc), 6.2(1H, d, vinylic), 4.4(1H, m, 
leucine a-H), 4.1(2H, dd, -C4H8CH2CONH), 4.0(1H, d, Gly a-H), 3.8(2H, s, Gly a- 
H), 3.7(1H, d, Gly a-H), 3.6(3H, s, CH3N+), 2.4(2H, dd, CH2N), 2.0- 1.5(9H, m, 
4xCH2, Leu CH(CH3)2), 0.85(6H, s. Leu CH(CH3)2); 5c (CD30D, 75.45MHz) 
177.5, 176.2, 174.3, 172.2, 156.6, 155.0, 145.9, 143.3, 143.1, 142.7, 142.1, 129.9, 
10 128.0, 127.3, 126.9, 123.5, 121.2, 112.7, 110.8, 105.7, 104.1, 56.8, 55.8, 51.0, 45.2, 
43.7, 42.9, 36.3, 31.4, 28.3, 27.8, 27.3, 26.1, 23.8, 22.2, 18.0, 13.1; Xmax 642nm. 

b) Coupling of Dve-Linker Group to Nucleoside 

15 Cy5-GlyGlyLeu-0H (3.1mg, 3.9Mmol) and N,NJ^I*,N'-tetramethyl-a.(N- 

succinimidyl)uronium tetrafluoroborate (1 .4mg, 4.7tmiol)[Fluka] were weighed into a 
ImL plastic tube. Anhydrous dimethylsulfoxide (50^tL) [Aldrich] was then added, 
followed by diisopropylethylamine (0.74mg, 5.8|iniol, l|al) [Aldrich]. The reaction 
mixture was agitated and then allowed to stand at ambient temperature for 1 hour. A 

20 solution of 5-allylamino-2 '-deoxyuridine-5 '-triphosphate triethylammonium salt 
(3,8mg, 4.1 imiol) in anhydrous dimethylsulfoxide (50|il) was added. The solution 
was agitated and then allowed to stand at ambient temperature. Monitoring of the 
reaction by reverse phase h.p.Lc showed the presence of a small amount of new 
material. The reaction mixture was diluted with O.IM triethylammonium bicarbonate 

25 (TEAS) buffer and lyophilised to give a gummy residue. The residue was then re- 
dissolved in water (200^1) and then eluted through a preparative reverse phase h.p.l.c 
column (O.IM TEAB - Acetonitrile, 95:5 - 0: 100 linear gradient over 30 minutes). 
Fractions containing the new material were collected and lyophilised. Further 
purification was performed by ion-exchange h.p.l.c to give the desired product as a 

30 blue foam after lyophilisation. 8h (D20, 300MHz) 7.9 - 7.1 (8H, m, aromatic), 6.4 - 
5.9 (5H, m, vinylic), 6.1 (IH, obs, H-1 '), 4,3 - 3.0(9H, m), 2.2(2H, m, H-20, 2.0 - 1.5 
(3H, m), 0.8 (6H, m); 5P (D20, 300MHz) -^.3, -11.1, -21.7; 644nm. 
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(vii) Cleavage of liiiker with Subtilisin 

A solution of approximately lOng of nucleoside in water (5\iL) was mixed with 1 unit 
5 of subtilisin (2jil of 0.5U enzyme stock solution) in the appropriate buffer for the 
enzyme (193|li1). Solutions were then held at enzyme optimum temperature for two 
hours and then centrifuged through 10,000D molecular weight size exclusion 
membranes [Amicon Microcon YM-10] to remove the enzyme. The appropriate 
subtilisin substrate reaction and standard control solutions were also prepared and 
10 treated m the same manner. The filtrates ware analysed using CI 8 reverse phase 
h.p.l.c (0.1%TFA/water : 0.042%TFA/Acetonitrile, 95:5 - 0:100 linear gradient). 
Successful cleavage of the linker was observed when new products at 4.2 and 25^ 
minutes (starting material retention time = 24.2 minutes). The peak at 4.2 minutes 
contained only the nucleotide chromophore. 



. 03/0481 78 PCT/GB02/05375 

28 

Claims 



1 . A nucleoside comprising a reporter moiety which also functions to Iknit 
polymerase activity, characterised in that the reporter moiety is attached to the 

5 nucleoside through a linkage group cleavable by a hydrolase enzyme, wherein said 
hydrolase enzyme is selected from the group consisting of esterases, phosphatases, 
peptidases, penicillin amidases, glycosidases and phosphorylases. 

2. A compound of Formula I 

10 

R - L 1 - L2- L3 - L4 - L5 - BASE - SUGAR 



15 wherein R is a reporter moiety 



LI and LS are optional linkage groups each containing one or more 
atoms comprising hydrocarbon chains which may also contain other 
atoms such as N, O and S. 

L2 and L4 are optional linkage groups comprising 1 or more amino 
acid residues. 



L3 is a linkage groiq) that is susceptible to enzymic hydrolysis by a hydrolase emymo, 
25 wherein hydrolytic cleavage may be within tibe group or adjacent to the group and 
characterised in that said hydrolase enzyme is selected from the gcoup consisting of 
esterases, phosphatases, peptidases, penicillin amidases, glycosidases and 
phosphorylases. 



30 



3. A compound according to claim 2, wherein enzymic hydrolysis of the linkage 
group L3 produces an unstable moiety which undergoes chemical hydrolysis. 



\/0 (^3/0481 78 PCT/GB02/05375 

29 

4. A compound according to either claim 2 or 3, wherein the base comprises 
purines or pyrimidines and, in particular, any of the bases A, C, G, U and T or 
analogues thereof. 

5 5. A compound according to any of claims 2 to 4, wherein the sugar comprises 
ribose or deoxy-ribose or analogues thereof. 

6. A compound according to any of claims 2 to 5, wherein a mono-, di- or 
triphosphate group is attached to the sugar. 

10 

7. A compound according to claim 6, wherein a triphosphate group is attached to 
the sugar. 

8. A compound according to any of claims 2 to 7, wherein the peptidase is 
15 selected fiom the group consisting of subtilisin, proteinase elastase,.neprilysin, 

thermolysin, papain, plasmin, trypsin, enterokinase and urokinase. 

9. A compound according to any of claims 2 to 8, wherein L3 is a peptide 
selected from the gjcoup consisting of alanino-alanine-alanine, alanine-alanine-leudne, 

20 glycine-leucine-serine, glycine-serine-alanine-alanine-leucine and glycine-alanine- 
glycine-leucine. 

10. A compound as claimed m any of claims 2 to 9 wherein R is a fluorophorc, 
selected from the group consisting of fluoresceins, rhodamines, coumarins, 

25 BODIPY® dyes, phenoxazdne dyes, cyanine dyes, acridone dyes and squarate dyes. 

11. A chemical intermediate of Formula n 

R-L1-L2--L3~L4-L5 

30 

(H) 



wherein R is a reporter moiety 
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LI and L5 are optional linkage groups each containing one or 
more atoms comprising hydrocarbon chains which may also 
contain other atoms such as N, O and S. 



5 L2 and L4 are optional linkage groups comprising 1 or more 

amino acid residues, and 

L3 is a linkage group that is susceptible to enzymic hydrolysis by a peptidase enzyme. 

10 12. A chemical intermediate according to claim 1 1, wherein L3 is selected from 
the group consisting of alanine-alanine-alanine, alanine-alanine-leucine, gilycine- 
leucine-serine, glycine-serine-alanine-alanine-leucine and glycine-alanine-glycine- 
leucine. 

15 13. A chemical intermediate according to either of claims 11 or 12, A^^erein R is 
selected &om the group consisting of of fluoresceins, ifaodamines, coumaiins» 
BODIPY® dyes, phenoxazine dyes, cyanine dyes, acridone dyes and squarate dyes. 

20 14. The compound 5-iV^[JV-(6-Fluorescein-5(ahd-6) carboxamidohexanoyl)-Gly- 
Gly-Leu-p-alanyl]-propargylamino-2'-deoxyuridine triphosphate. 

15. A set of nucleotides characterised in that the set contains at least one 
compound according to claim 6. 

25 

16. A set of nucleotides as claimed in claim IS comprising each of flie four natural 
bases A, G, C and T and analogues thereof. 

17. A set of nucleotides as claimed in either of claims 15 or 16 further comprising 
30 at least two compounds according to claim 6 having different bases and characterised 

in that each compound has a different reporter moiety, R. 
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18. A set of nucleotides as claimed in any of claims 15 to 17 comprising four 
compounds according to claim 6 characterised in that each compound has a different 
base such that each of the bases A, G, C and T, or analogues thereof, are present and 
each of the four compounds has a reporter moiety which is distinguishable from the 

5 reporter moiety of each of the compounds having the other three bases. 

19. A method for nucleic acid molecule sequencing comprising the steps of: 

a) immobilising a complex of a primer and a template to a solid phase 

b) incubating with a polymerase in the presence of a compound according to claim 6. 

10 

20. A method as claimed in claim 19 further comprising the steps of: 

c) detecting the incorporation of a compound according to claim 6 

d) incubating in the presence of hydrolase enzyme under suitable conditions for 
enzymatic cleavage of the enzyme-cleavable gjcoxxp L3. 

15 

21 . A method according to claim 20, wherein the hydrolase enzyme is selected •. 
fix>m the group consisting of esterases, phosphatases, peptidases, penicillin amidases, 
glycosidases and phosphorylases. 

20 22. A method according to claim 21, wherein the pq)tidase is selected from the 
group consisting of subtilisin, proteinase K, elastase, neprilysin, thermolysin, papain, 
plasmin, trypsin, enterokinase and uroldnase. 

23. A method as claimed in any of claims 19 to 22 further conoqprising: 
25 e) repeating steps a)-d)- 

24. A method according to any of claims 19 to 23, wherein the incorporation of 
the compound is determined by the detection of a single rq)orter group attached to the 
compound. 
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SEQUENCE LISTING 



<110> 


Amersham Biosciences UK Limited 


<12ff> 


Nucleotide Analogues 


<130> 


PA0191 


<140> 


GB 0128526.1 


<141> 


2001-11-29 


<160> 


,5 




IraT-Sn U J.ri Vd.&xUXl 0-± 


<210> 


1 


<211> 


18 


<212'> 


DNA 


<213> 


Artificial Sequence 


.<220> 




<223> 


synthetic oligonucleotide 


<400> 


1 • 


taactcatta acaggatc 


18 


<210> 


2 


<211> 


50 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Oligonucleotide 


<400> 


2 



attcgcggta ttctggtatg aagcttttag atcctgttaa tgagttagta 
50 



<210> 3 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 3 

attcgcggta ttctggtatg aagctttaag atcctgttaa tgagttagta 
50 
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<210> 4 
<211> • 50 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 4 

attcgcggta ttctggtatg aagcttaaag atcctgttaa tgagttagta 
• 50 

<210> 5 
<211> 50 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic Oligonucleotide 
<400> 5 

attcgcggta ttctggtatg aaaiaaaaaag atcctgttaa tgagttagta 
50 
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